New Invariants for the Graph Isomorphism 

Problem 



Alexander Gamkrelidze, Giinter Hotz, and Levan Varamashvili 

1 Department of Computer Science, 
Tbilisi State University, 
Building XI, Room 355, 

Tbilisi, Georgia 
2 University of Saarland 
Campus E 1.1, Room 311 
Department of Computer Science 
alexander . gamkrelidzeOt su . ge 
g . hotzOrz . uni- sb . de 
varlevani@gmail . com 



Abstract. In this paper we introduce a novel polynomial-time algo- 
rithm to compute graph invariants based on the modified random walk 
idea on graphs. However not proved to be a full graph invariant by now, 
our method gives the right answer for the graph instances other well- 
known methods could not compute (such as special Fiirer Gadgets and 
point-line incidence graphs of finite projective planes of higher degrees). 
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1 Introduction 

The computational complexity of graph isomorphism remains unresolved for 
over three decades now. No polynomial-time algorithm deciding whether two 
given graphs are isomorphic is known; neither could this problem be proved to 
be NP-complete. Besides its practical importance, the solution of this problem 
would give us a deeper insight into the structure of complexity hierarchies (in 
case it is NP-complete, the polynomial hierarchy would collapse to its second 
level |15j . [3]). The counting version of Graph isomorphism problem is known 
to be reducible to its decisional version [11] . while for all known NP-complete 
problems their counting versions seem to be much harder. On the other hand, 
the known lower bounds in terms of hardness results for Graph isomorphism 
problem are surprisingly weak. Because of such theoretical results and of many 
failed attempts to develop an efficient algorithm for graph equivalence or prove its 
NP-completeness, it could be one of the intermediate problems that are neither 
in P nor NP-complete — P^ NP assumed. 

Despite these facts, efficient algorithms are known for several special classes of 
graphs such planar graphs |16j . [7J, random graphs [jQ, graphs with bounded 
eigenvalue multiplicity [2 , graphs of bounded genus [5] and graphs of bounded 



2 Alexander Gamkrelidze, Giinter Hotz, and Levan Varamashvili 

degree [9]. In some cases, like trees [8], [4], or graphs with colored vertices and 
bounded color classes [10], even NC algorithms for isomorphism exist. An inter- 
esting result is due to Erdos et al. [T] who showed that the isomorphism prob- 
lem is easily solvable by a naive algorithm for "almost all" graphs: only a small 
amount of special graphs make all known algorithms impracticalble. Surprisingly, 
it is very hard to find graph instances that cause problems for all known graph 
isomorphism systems. Only highly regular graphs such as Fiirer gadgets, point- 
line incidence graphs of finite projective planes or graphs for Hadamard matrices 
cause problems to even leading graph-isomorphism solvers such as " nauty" |12j . 
In this paper, we introduce a novel algorithm for graph isomorphism based on a 
modified random walk approach that is due to the second author. For each graph 
with n vertices we build a set of n(n — l)/2 quadratic polynomials that builds 
a graph invariant. The upper bound of the time complexity for the developed 
algorithm is 0(n 5 ), but in most cases it can be reduced to 0(n A ). 
At the end we give computational results - tables of the running times of our 
program on some difficult graphs (special Fiirer gadgets and point-line incidence 
graphs of finite projective planes of higher degrees). 



2 Basic Ideas 

The idea of the modified random walk is due to the second author. We define 
the random walk on graphs as follows: given a graph G — (V, E) with vertex 
set V and the set of edges E if at time t.j we are in a vertex v G V, at the next 
moment ti+i we decide to move to one of the neighboring vertices or to stay 
in the actual vertex (here and further in this work we consider discrete time). 
The probability to stay in the actual vertex Vi be Pi and the probability to move 
from vertex Vi to Vj be pij. If we decide to change the vertex, the probabilities 
to choose one of the neighbors are equal and we get the following formula: 

Pl - J ~\0 else 
where d(vi) is the degree of u, G V. 

Choosing different probabilities pi for staying in a particular vertex Vi we can 
vary the changing probabilities pij thus producing different directed weighted 
graphs that originate from G with the connection graph M' G = (m! i where 
Tn'ij is the changing probability from Vi to Vj if i ^ j and the staying probability 
else. 

Dividing each row by we can normalize M' G and get another connection 
matrix Mq — (rriij)" where 

v Pid(vi) 

mi.i = A 2 = 

1 — Pi 



and for % ^ j 
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1, if (Vi,Vj) £ E 
else 



Thus for a given graph G we get a connection Matrix M G as usual with variables 
Xi at the diagonal. 

up to the positions = rrijj 



Now consider a connection matrix M% (rj) with fixed values 77 at the diagonal 



= X: 



M G ' 5 (v) 



(ril 1 
1X001 




1 1 
1 

V01 



10\ 
1 

77 1 1 1 

1 77 1 1 
11X01 
10 77 1 
11 1 77/ 



(l 1 1 1 0\ 
110 10 1 
11110 
1110 1 
111110 1 
10 10 11 

\0 101111/ 



det(M^) 



\Mg\ 



Let P^iX) 
with the diagonal elements 1. 
It is easy to check that Mq 3 (1) 



and Aq be the usual connection matrix for G 



(\B G j \ 



m\)x 



\C G 3 \ 1 where 



G 3 is the matrix Aq with ith and jth raw and column eliminated; 

G 



A, 

B% 3 is the matrix Aq with ith raw and column eliminated and ma = 0; 



— CA J is the matrix Aq with m 



m 



3 j 



A u= 



A 



44 — 14 
B 



c 



Fig. 1. Figure Example 



Since Pq J (1) = |Ag|, the polynomial P G 3 (1) — |Ag| is divisible by X — 1 and we 
get: 

l g \x) = p G J a)ji^i = + (|Ag] _ i^i) 

(this can be easily checked using the equation \A G \ — \Cq\ — \A G 3 \ = \B G 3 \ + 

"in 

Hence, 

% G = \(\A G j \) n I and € G = \(\C l 6 k \Y \ 
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are graph isomorphisms (here and further in this work we set n = \V\ as the 
number of vertices of G) . 

Note that if 2l G 7^ 2t<j or 7^ * ne graphs G and G" are definitely not 
isomorphic. On the contrary, if 2l G = 2t' G and — £ G , these two graphs are 

not proved to be equivalent. But if we regard f|j4jj'|) and (|C G ,fe |) 

V / i.J — l V / i,j=l 

as the connection matrices for some weighted graphs, we can repeat the whole 
method recursively by setting Aq = I \A l G \ ) and Ac = ( \A G J ,\ ) 
We summarize the above ideas in the following algorithm: 



Algorithm Graphlsomorphism 

Input: connection matrices Aq and Ac of the graphs G and G'; 



Repeat the following code twice 
{ 

compute the matrices (|^4 G '|) and ; 

If (the sets of entries of these matrices are not equal) 
G ^ G 1 and stop; 
else If (2l G ^ 2t G 

G^G' and stop; 
else If (£ G ^ £ G 

G^G' and stop; 

Set Aq = (\A l G J \) l i=i and A G , = (\A G 
} 

G = G' 



Obviously, the complexity of this method is 0(n 5 ), but using the special methods 
(computation of the inverse matrix of Aq in case det(Ao) ^ and using its 
symmetry) the upper bound can be reduced to 0(n 4 ). 

Another problem that arises during the computation is that the determinants 

I ( \ A l G \ ) I and | ( \ A G J \ ) | can become very large. In our computations, 

for the projective plane of order 16 with the 546 x 546 connection matrix each 
determinant is 1086 decimal digits long (these computations were done using 
Wolfram Mathematica 8). This problem can be solved by the computations in 
different finite fields Z pn Di = \A' G \ mod pi, I € {1, ...,£;} for coprime PiS so 

that P = pi ■ P2 ■ ■ ■ Pk > \A l G \- \^g\ e can b e restored due to the Chinese 
remainder theorem. 



3 Computations and experimental results 

As was shown in lj, the graph isomorphism problem can be solved efficiently 
for almost all graphs. The problems arise only considering special cases. The 
most efficient system known for today is Bredan McKay's Nauty (T2]. It is very 
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efficient for most known hard graphs but has exponential running time on special 
family of graphs of bounded degree called Miyazaki graphs [14 . Besides this, 
most known systems have significant problems with the graphs originated from 
projective planes. 

In this section we show the experimental results after applying our methods to 
distinguish some types of hard graphs. 

3.1 Fiirer Gadgets: Miyazaki graphs 

Miyazaki graphs are special cases of a so called Cai-Furer-Immerman construc- 
tion based on Fiirer gadgets ([!]). As an example consider a Miyazaki graph M4 
shown in Fig. [2] (a) . 




(d) (e) 



Fig. 2. Example of a Miyazaki Graph and its parts 

The bold circles represent special subgraphs shown in Fig. [2] (b) — (e). One can 
easily extend this construction to a Miyazaki grapf Mk with arbitrary k. 
Analogously, we define a twisted Miyazaki Graph MT4 2 that is identical with 
M4 up to the 4th pair of bold knots: here two connections are twisted (compare 
Fig. [3]). One can easily generalize this construction to MT n ^ with arbitrary 
n, k € N and k < n. 

Distinguishing M n from MT n ^ is a hard problem for graph isomorphism software 
and is widely used as benchmark tests. The following table shows the running 
times of our program on the Miyazaki graphs of different size. 



() 




Fig. 3. Miyazaki Graph M 4 and twisted Miyazaki Graph MT 4>2 



Size 


Matrix 


Computation 


n 


Dimensions 


Time in sec 


10 


200 x 200 


9 


15 


300 x 300 


26 


20 


400 x 400 


60 


25 


500 x 500 


133 


30 


600 x 600 


1069 


35 


700 x 700 


1940 


40 


800 x 800 


5755 



Remark: The determinants of the original connection matrices of Miyazaki 
graphs and their twisted counterparts are zero, so there is no fast computation 
possible using the inverse matrices. Therefore we changed the corresponding 
connection matrices inserting 3 as diagonal elements. Due to this, for Miyazaki 
graphs with sizes n < 25 faster computation was available. This explains the 
sudden rise in computation time for size 30 and more. 

3.2 Block Designs: Projective Planes 

A projective plane of order n is an incidence structure on n 2 + n + 1 points, 
and equally many lines, (i.e., a triple (P,L,I) where P, L and / are disjoint 
sets of the points, the lines and the incidence relation with I C P x L and 
\P\ = \L\ =n 2 + n + 1), such that: 

— for all pairs of distinct points p,p' £ P there is exactly one line I £ L such 
that (p, I) £ I and (p' , I) £ I; 

— for all pairs of distinct lines 1,1' £ L there is exactly one point p £ P such 
that (p, I) £ I and (p, I') £ /; 

— there are four points such that no line is incident with more than two of 
these points. 

The smallest projective plain is the Fano Plane of order 2 shown in Fig. [4] (a). 
It is also the only plane of this order. 
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We consider the corresponding incidence graphs of different projective plains and 
try to solve the graph isomorphism problem for them. The incidence graph of 
the Fano Plane is shown in Fig. [4] (b). In general, differentiating projective planes 
of the same order by their incidence graphs poses a difficult challenge for graph 
isomorphism algorithms. The web page of Moorhouse [T3] offer a collection of 
known projective planes. 




Hence the computation in Wolfram's Mathcmatica 8 takes very long time for such 
large graphs, the algorithm was implemented in the C programming language. To 
calculate very large numbers while computing 21g or €a the Chinese remainder 
theorem was used. Each dfj k = \A 1 q\ mod pk was computed modulo a prime 
43969 < Pk < 44699 (l G '{1, 2, 72}) and the result was calculated by the 
formula 



72 

\ A G I = J2 d hJ,k m k S k, 
fc=l 

where m k = m/pk, m = p% ■ ■ ■ P72, and s k = mT/ 1 mod pi. 

So, instead of very large numbers, we have to manipulate with lists of relatively 
small numbers: each \^lq\ corresponds to a list of 72 natural numbers that we 
denote by Hq*' = {dfj k \l < k < 72}. In fact, it suffices to compute each 
\Aq\ mod pi (modulo one sufficiently large prime). If the sorted corresponding 
lists Hy> A ^ H^,' 1 (where H^' k = Sart({\A% j \ mod p k \l < k < 72}) we can 
be sure that the graphs G and G' are not isomorphic. Else we get two weighted 
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graphs ( df i k ) and I df- k ) with m = 2(n 2 +n+l), where n is the order 

of a projective plane and repeat the whole algorithm for them. If Hq*' 1 * = H l Q/ k 
for each 1 < k < 72 in the second iteration, the graphs are supposed to be 
isomorphic. 

In general, if H^P' — Hq 1 / for each 1 < k < 72, we can consider the weighted 
graphs as above and proceed with the iteration, but we need a halting criteria to 
decide when the two graphs are isomorphic. In our experiments, the differences of 
most non-isomorphic graphs were discovered in the first iteration step, however 
for some projective planes two iterations were needed. 



Order 


No. of 


Matrix 


Iteration 


Computation 


n 


Planes 


Dimensions 


Steps 


Time in sec 


9 


4 


182 x 182 


2 


63-2=126 


11 


1 


266 x 266 


2 


162-2=324 


13 


1 


366 x 366 


2 


467-2=934 


16 


22 


546 x 546 


1 


5460 


25 


193 


1302 x 1302 


1 


52080 



Note that in some cases (such as for n = 11, 13, 17, 19 or 23) there is only one 
real projective plane, however it is self-dual. 

For n = 27, the determinants for the 1415 x 1415 matrices were calculated on a 
parallel machine with 20 Intel Xeon 2,80GHz CPUfj^j While the computations 
for basic graphs took relatively reasonable time, the computations for their duals 
lasted over twenty times as much for one iteration step (see the table below). It 
is a matter of further research to determine the reason of the break-ins in the 
computational complexity for dual graphs. 



Andre 


Hering 


Sherk 


Flag 4 


Flag 6 


15584.9+89377.6 
104962.5 


38125.7+9757.26 
47882.96 


44448.8+9726.34 
54175.14 


35047.3+9771.57 
44818.87 


34881+9773.86 
44654.86 


Andre Dual 


Hering Dual 


Sherk Dual 


Flag 4 Dual 


Flag 6 Dual 


944676 


976157 


964906 


984884 


1036820 



4 Conclusions and open questions 

In this paper, we have developed an 0(n 5 ) polynomial-time algorithm that com- 
putes graph invariant for graphs with n vertices. The main approach is based 
on random walks on graphs with the probability to stay in the actual node. 
Due to this, we generate a set of n(n — l)/2 quadratic polynomials of one vari- 
able. Comparing these sets we can distinguish different graphs. In some cases, 
two iterations of the algorithm are needed. The experimental results on some 
hard graphs (Miyazaki graphs as special Fiirer gadgets and point-line incidence 
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graphs of finite projective planes of higher degrees) show that our system can 
distinguish non-isomorphic graphs of this kind in reasonable time. It is a matter 
of further research to prove (or disprove) that the given method is a full graph 
invariant and to investigate the question why there are relative break-ins in the 
computational time for some point-line incidence graphs of dual finite projective 
planes of high order. 
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